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ABSTRACT 

The robustness of "Student's" t-test with regard to the 
assumption of normality is investigated. This is accomplished 
by empirically developing the distribution of a new statistic 
from a large number of samples of varying sizes from the Beta 
and the Gamma distributions using different values of the 
parameters. The various significance levels of this new 
distribution are then compared with the corresponding signifi- 
cance levels from "Student's" t distribution using an IBM 360 
computer. A comparison of the distribution frequencies with 
the standard normal distribution frequencies is also presented. 
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INTRODUCTION 



I . 

In practical work, an Operations Research Specialist is 
generally involved with the problems of bringing the theoret- 
ical structures into some degree of correspondence with the 
situations of practical experience. One of the common methods 
of achieving this objective is by hypothesis testing. The 
hypothesis testing method makes use of a variety of statisti- 
cal techniques and procedures involving both parametric and 
non-parametric tests. In the derivation of most of the para- 
metric tests, it is usual to assume a form of mathematical 
model involving some specific probability distribution and 
then to select some statistical criterion that is sensitive to 
change in the specific factors tested. This type of criterion 
is generally known as the "power of the test". 

However, another desirable requirement of any statistical 
test is that it be insensitive to change in the underlying 
assumptions. This property of the test is generally referred 
to as the "robustness" of the test. Common questions probing 
into the robustness of a test are: When the actual distribu- 

tion is not known, which statistical tests can be used with 
less hesitation?; and What percentage of error would be 
incurred if the underlying assumptions are violated? Past 
research and subsequent literature indicate that the property 
of robustness is generally not satisfied by the parametric 
tests. This is substantiated by research conducted by G.E.P. 

Box [7] and R. C. Geary [8], 

One of the most common parametric tests used for hypothesis 
testing is "Student ' s" t-test. This test was developed by 
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William Sealy Gosset in 1908 when he was working for Guinness 
Brewery in Dublin, Ireland. The discovery of the t-test was 
a direct result of the peculiar problems of brewing with its 
variable material and its susceptibility to temperature 
changes. A number of experiments conducted at the brewery 
emphasized the limitations of large sample theory and revealed 
the necessity for a correct method of treating small samples. 
It was, therefore, the circumstances of Gosset' s work which 
led to his discovery of the Theory of Errors and the distri- 
bution of the Sample Standard deviation, which was developed 
later into the well known t-test. 
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II . "STUDENT'S 1 t DISTRIBUTION 

As was mentioned earlier, the Theory of Errors finds its 

origin in the fact that the accuracy of the mean of a number 

of observations may be estimated from the discrepancies observed 

among the individual values used in obtaining the mean. When 

the samples are drawn from a normal population, having a 

2 

variance (mean squared deviation) equal to c , it can be 

shown that the mean of N observations x is also normally dis- 

2 2 

tributed with variance o_ = — . Thus if the variance of the 

x N 

mean is known, then the desired information regarding the 
distribution of the mean can be easily determined. In order 
to test a hypothesis that the population has mean y / one 

would merely need to calculate t = and then the integral 

-+-2 

oo 



I = 1 



;srJ 



dt would give the probability that a more 



discrepant value would occur. If the value of I so calculated 
turns out to be a small quantity, such as .01, one should con- 
clude with some confidence that the hypothesis was not, in 
fact, true of the population sampled. 

In a majority of the cases in which such tests are requir- 
ed, one can find no prior knowledge of the variance of the 
population. The variance of the population can be estimated. 



however, from the sample itself by the following relationships: 

2 i N 

If X-^ ,X 2 , . . . /X N represents a sample; then s = Z (X^-x) 

N-l j_ = i 



2 i N 

is an estimate of the unknown variance a , and x = i Z X. is 

N i=1 1 



the sample mean and an estimate of the population mean. 
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W. S. Gosset, in his fundamental paper of 1908, showed 

that even though s as calculated above is a good estimate 

of o , it would be erroneous to assume that the statistic 

t = — would be normally distributed or that the signifi- 

S //N 



cance of an observation could be accurately tested by using the 

2 

normal probability theory. Assuming the independence of S 

and x , Gosset was able to develop the exact distribution of 

this statistic, which in its modern form is known as "Student's" 

t distribution. It states that if X and U are two independently 

distributed random variables such that X is normally distributed 

2 

with mean y and variance a , and U has chi-square distribution 
with N degrees of freedom, then the ratio t = //u/N has 

the distribution of "Student's" t, and its density function is 
given by 



f (t) 



r<S±h 



, M 

W r(|) 



t^ 

u + 



N+l 



where -°°<t< co 



From this definition, it can be seen that "Student's" 
distribution depends on N only and is independent of both 
parameters of the normal distribution sampled. It is further 
noted that the derivation of "Student's" t distribution 
depends on the fact that the random variable X be drawn from 
a normal population. 

One of the common uses of "Student's" t-test is to compare 
the mean of a population with some standard. In practical 
problems, however, it is often the case that the distribution 
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of the population being sampled is not known or is not normal. 
It is therefore desirable that some measure of the degree of 
error be obtained for "Student ' t-test when the assumption 
of normality of the sample is violated. 

Literature reveals that a great deal of research has been 
directed towards many aspects of "Student' s" distribution. In 
1925, R. A. Fisher very stringently verified the independence 
of the sample mean x and the sample variance s^. This was 
one of the basic assumptions in the development of "Student's" 
distribution [2]. In 1936, R. C. Geary investigated the 
distribution of "Student's" t ratio for the samples drawn from 
a slightly asymmetrical universe by developing asymtotic formu- 
lae for the moments of the t distribution. He showed that for 
symmetrical, non-normal populations, "Student! s" distribution 
gives more accurate results than skewed sampling distributions. 
(4) In 1948, A. K. Gayen, using the moments techniques, 
studied the behavior of "Student's" t and obtained results 
similar to those obtained by R. C. Geary in 1936 [5] . In more 
recent work, Cucconi 0 developed a simple relation between the 
critical values of "Student's" t and the degrees of freedom. 
Through this relation, in addition to deriving the critical 
values of t for any number of degrees of freedom, he showed 
that it is possible to substitute "Student' s" criteria in the 
verification of statistical hypothesis more easily than using 
a standardized normal deviation [9] . These are but a few 
examples of the diversified attention given to "Student's" 
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t-test. The aim of this paper is to investigate the robustness 
of "Student's" t— test when the sampling data is drawn from 
the Beta and the Gamma distributions. 
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III. METHODS AND PROCEDURES 



In order to investigate the robustness of t-test, it was 
decided to select two well known distributions with a wide 
scope of application in the real world. The two distributions 
selected are the Gamma and Beta distributions. A brief 
description of these distributions follows. 



A. GAMMA DISTRIBUTION 

A continuous random variable Y is said to have the Gamma 
distribution if its density is 



f (y) = 



T (a) 3 a 



IX 

1 y a e ^ , 0 <y< c 



= 0 , otherwise 

It should be noted that this is a two parameter family of dis- 
tributions with parameters a and 3. The distribution is only 
defined when the value of a>-l/ and the value of B>0. The 

mean and variance of this distribution are respectively aB 
2 

and aB . The Gamma distribution is frequently applied to the 
solution of the queuing and inventory control problems. A 
graphical representation of the Gamma distribution for various 
values of its parameters is given in Appendix A. 

One of the special and most often used family of Gamma 
distributions is called the Erlang distribution. In order to 
obtain this distribution, the values of the parameter a in the 
Gamma distribution are restricted to positive integer values 
and this positive integer is denoted by K. If one replaces 
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1 

3 by j , then the distribution becomes 

K-l -Ay 

f (y) = - ^y) ® for y>0 

(K-l)! ~ 

= 0 , otherwise 

It is possible to show that the latter f (y) can be generated 
by the sum of K variates Z having the exponential density 

f(z) = A e when z>0 

= 0 , otherwise 

It should be noted that when K = 1, we have the exponential 

density as a special case. The mean, variance and mode of the 

K K K-l 

Erlang distribution are ^2 and ^ respectively. The great 

usefulness of this distribution stems from the fact that it is 
a large family of distributions permitting only non-negative 
values. The waiting time and the service time distributions 
in the queuing theory can, therefore, be reasonably approximated 
by an Erlang distribution. 

B. THE BETA DISTRIBUTION 

A continuous random variable Z is said to have a Beta 
distribution when 

f(z) = r . z a-1 (l-z) e-1 when 0<z<l 

r (a) r (3) 

= 0 , otherwise 

This density is a function of the two parameters a and 3, both 
of which are positive constants. The mean and variance of 
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this distribution are given by an< 3 



(g+1) (B+l) 



A 



(a+e+2) 2 (a+3+3) 



graphical representation of the Beta distribution for various 
values of the parameters in furnished in Appendix B. 

It may be pointed out that when a=3=0 the distribution 
becomes the uniform distribution over the unit interval . The 
Beta distribution, though in general defined over the unit 
interval, can also be defined over any interval (a, b) as 
follows 



Primarily because the Beta distribution is defined on a 
finite interval, it is widely used in the study of critical 
path scheduling techniques such as PERT. It may be recalled 
that PERT is a method of reflecting the uncertainties assoc- 
iated with development-type tasks, and its use generally 
results in providing a more realistic outlook for task accom- 
plishment than the conventional systems provided in the past. 
In order to obtain the critical path through the network, it 
is necessary that a distribution defined between the pessimis- 
tic and optimistic time estimates adequately represent the 
distribution of time required to perform an activity. Since 
the Beta distribution restricts the probability distribution 
to a finite range for varying values of the parameters, it is 
widely used in the critical path scheduling. 



f (z) = £ (z-a) 01 (b-z) ^ where a<z<b 



0 , otherwise 



where E, = (z-a) a (b-z) ^dz 




a 
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c. 



GENERATION OF DISTRIBUTIONS 



The inverse probability integral transformation technique 
is used to produce random variates having the exponential 
distribution. According to this technique, the cumulative 
distribution function for the uniform probability density 
function on the interval (0,1) is equated to the cumulative 
distribution function for the desired probability density 
function in order to obtain the necessary transformation. 

Thus if RN denotes a random variable on the interval (0,1), 
and X is the desired parameter of the exponential distribution; 
then the random variable obtained from the transformation 

X = 7 _ . is exponentially distributed with a mean y = i. 

X X 



1 . Erlang Distribution 

In order to generate the Erlang distribution, an 
extension of the above procedure is utilized by first generat- 
ing exponential random variables with mean i . Then K of 

X 

these exponential variates are summed using the relationship 



X 



N 

_ j An (RN) j to produce the desired Erlang distribution 
• “ A 



i=l 



with mean = ^ and variance ^ , 

x x^ 



2 . Beta Distribution 

The Beta distribution is generated using the following 
theoretical ideas. Consider two random samples X^,X 2 ,...,X 
and Y ]/ Y 2 ' * * * /Y m suc ^ that the random variables X and Y are 
identically distributed exponential with parameter X . If we 
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n m 

form the statistics Z^=Z and Z 2 =Z then 

i=l i=l 

Z 1 ^ r * z l ; n,y * 

Z 2 - r ( z 2 ; m,y ) 

and their joint density function is given by 

z l+z 2 

n-1 ra-1 -( n — ) 

Zi z 2 e 

g(z ,z ) = , 0<z n < 0 ° and 0<Zj<°° 

T (n) T (m)y in+n 

z i 

Now using the transformation V7, = ± and = Z.+Z^, the 

1 z 1 +z 2 2 12' 

joint density of and W 2 is given by: 

, . , ( Wl w 2 ) n “ 1 w 2 in ~ 1 (i- Wl ) ra_ l "~ 

h(w^ 7 w 2 ) = — : e ^ • w„ 



T (n) T (m) y 



m+n 



Integrating over the domain of W 2 , the marginal density of W^; 
h(w 1 ) = j o h (w 1 / w 2 ) dw 2 



i.e. , h (w^) = 



n-1 .m-1 

W-l (1-Wn) 



T (n) T (m) y 



n+m 



w 0 

o _ Z 

n+m-1 *r r 
w~ e H dw. 



Using the substitution U = W 2 /y and dW 2 = y du we get: 



n-1 ,, m-1 

w, ( 1— W-i ) f 00 

h (w, ) = L_ 

1 r(n)r(m) 



u n+m-l e -u du 



_ r (n+m) ,n-l M ^m-1 

' TWT57 w i (1 w l’ 
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has the Beta distribution 



We observe that 



Y +Y 
12 



with a=n, 3 = 111 , and this Beta distribution is independent of 

the parameter of the exponential distribution provided that 

the two Gamma distributions are generated by random numbers 

having identical exponential distributions. Thus to generate 

the Beta distribution, two Gamma distributions with parameters 

^ K l 7 ^0 and are generated using the above techniques. 

Y-, 

From these distributions, the ratio W-, = — — is formed to 

1 y 1+ y 2 



K 1 

obtain a Beta distributed random variable with mean = — — — 

k 1+ k 2 

k i K ? 

and variance x . 

(K 1 +K 2 ) 2 (K 1 +K 2 +1) 



Do SIMULATION TECHNIQUES 

The accuracy of generating random observations from a 
probability distribution depends upon the characteristics of 
the uniform random number generator. At least the superficial 
characteristics of the random number generator are ascertained 
by the use of chi-square test for goodness-of-f it at 95% and 
99% significance levels. For the chi-square test, the unit 
interval is divided into fifty equal intervals. Each generated 
random number is assigned to one of the fifty categories 
according to its size. From this data, a measure of the dis- 
crepancy existing between observed and expected frequencies is 

. ? T (o • -e • ) 2 

obtained using the relationship X = Z v i i' where o- 

i=l e. 

denotes the observed frequency of the ith category, and e^ 
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denotes the expected or theoretical frequencies of this occur- 
ence; T denotes the total number of categories. The number of 
degrees of freedom for the statistics is equal to T-l, where 
T is the number of categories or classes. 

For the purpose of this project, four simulated distribu- 
tions were obtained for each type of distribution sampled. 

Each simulation utilizes 5000 samples and a fixed sample size. 
In the discussion throughout this paper, sample size is referr- 
ed to as the number of data points within each sample used to 
calculate the t statistic and the number of samples denotes the 
number, of replications for fixed values of the parameters with- 
in each cycle. In order to obtain different shapes of the 
distribution, five different sets of values of the parameters 
are then used within each simulation. 

Since the basic techniques and procedures employed in 
each of the four simulations for the Erlang and the Beta 
distributions are essentially the same, it is considered 
adequate to discuss the methodology of one simulation for each 
distribution in detail. 

1 . Erlang Distribution 

Simulation 1 for the Erlang distribution consists 
of four steps. Step I consists of five cycles. For each 
cycle, 5000 samples of size 5 are independently generated 
from the Erlang distribution using the values of the parameters 
as shown in Table 1 below. For the purpose of discussion, 
these sampling distributions are referred to as y(K,A) 
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TABLE 1 



Cycle 


Cycle 1 


Cycle 2 


Cycle 3 


Cycle 4 


Cycle 5 


Volue of 
Porameters 

Y (K , X ) 


K = 4 
X = 3 


K =5 
X =3 


K =6 
X =3 


K = 7 
X =3 


K =8 
X =3 


Mean 


1.333 


1 .667 


2.0 


2. 333 


2.667 


Standard 

Deviation 


.6669 


.7408 


.8200 


.8624 


.9368 



To illustrate the use of Table 1, 5000 samples of 
size 5 are generated during cycle 1 using parameters K =4 , X 
=3. The mean and standard deviation of this Erlang distri- 
bution are given in rows 3 and 4 of Table 1. During Cycle 2, 
another 5000 samples of size 5 are independently generated 
from the Erlang distribution using parameters K =5, X =3. The 
mean and standard deviation of this distribution are 1.667 
and .7408 respectively. These values are furnished in rows 3 
and 4 of Table 1 under the heading Cycle 2. The same process 
is repeated for Cycles 3, 4, and 5 using the appropriate 
parameters . 

X-y 

In Step II, for each sample the statistic t = 



is calculated where x is sample mean, s is sample standard 
deviation, N = sample size (5) , and U is population mean (1.33) . 
The absolute values of this statistic are then compared with 
the corresponding values from the "Student's" t-distribution 
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with N-l degrees of freedom and at significance levels a=.8, 
.5, .2, .1, .05, .02, .01, and .001. These values of 



"Student's" t-distribution with varying degrees of freedom 
and significance levels are generally referred to as the 
critical values t M of t-distribution and are the solutions to 

ot 



the equation 1-^:= 



'ta 



f(t)dt for various values of alpha. The 



number of t statistics whose absolute values are greater than 
the corresponding critical values at each significance level 
are recorded for comparison purposes. 

In order to fully understand the comparisons between 
the actual significance levels and the significance levels 
obtained from the Erlang distribution, it is necessary that 
some knowledge regarding the comparisons between the distri- 
bution frequencies be known. This is accomplished in Step III 
by transforming the total number of data points for the Erlang 

distribution in each cycle (5000 X 5 = 25,000) using the trans 
X.--U 

formation , i = 1 , 25000, where y and a are the sampling 

a 

distribution mean and standard deviation respectively. The 

transformed distribution is called y(0,l). It may be pointed 

out that if the sampling distribution were normal, with mean y 

X-y 

and standard deviation a then this transformation — would 
reduce the sampling distribution to what is called Standard 
Normal, mean zero, variance unity. This is generally denoted 
by N(0,1). The transformed distribution, Y(0,1) is then 
divided into four intervals of width unity on either side of 
assumed mean of zero, and the frequency counts within each 
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interval are recorded. These counts are then compared with 
the expected number of frequency counts within each interval 
assuming that the sampling distribution was normal. 

In Step IV, the chi-square goodness-of-fit test, using 
a significance level of 0.05, is conducted on the results of 
the observed significance levels and the distribution frequen- 
cies as calculated in Steps II and III above. This concludes 
Cycle 1 of Simulation 1. For Cycles 2, 3, 4, and 5, the entire 
procedure as outlined above is repeated by changing the para- 
meters of the Y (K „^) distribution in accordance with Table 1. 

Simulations 2, 3, and 4 for the Erlang distribution 
use exactly the same techniques as outlined under Simulation 
1 except that the sample sizes are 10, 15, and 20 respective- 
ly. 

2 . Beta Distribution 

Simulation 1 for the Beta distribution is conducted 
in four steps. Step I consists of five cycles which are shown 
in Table 2. For each cycle, 5000 samples of size 5 are inde- 
pendently generated from each of the two Erlang distributions 
using fixed values of the parameters as shown in Table 2. 

These distributions are referred to as y^(K^,X) and y 2 ^ 2 '^ ’ 
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TABLE 2 



Cycle Number 


Cycle 1 


Cycle 2 


Cycle 3 


Cycle 4 


Cycle 5 


Values of 
Parameters 


K, =4 


K. =4 


K, =4 


K i = 4 


K, =4 


r,( K,,X ) 


X = 3 


X = 3 


X = 3 


II 

o* 


X ■ 3 


Volues of 
Parameters 


II 

OJ 

X 


K 2 * 5 


K 2 ■ 6 


K 2 = 7 


00 

II 

CsJ 

X 


r z (k 2 , X ) 


X s 3 


X *3 


X * 3 


X * 3 


II 

C Pi 


Mean (Beta) 


.5 


.44 


.40 


.36 


.33 


STD. DEV. (Beta) 


.167 


.158 9 


.150 


.140 


.1329 



To illustrate the use of Table 2, 5000 samples of 



size 5 are generated for Yi(K^,X) distribution using the para- 
meters =4 and X =3. Another 5000 samples of size 5 are 
independently generated for Y2 ( k 2'^ distribution using para- 
meters =4 and ^ =3 as shown in Table 2. During Cycle 2, 

5000 samples of size 5 for each of the Yi( k i*M and Y2( K 2'^ 
distributions are independently generated using the parameters 
K-^ -4 , X =3 and =5 , X =3 respectively. These values are 
found in Table 2 under the heading Cycle 2. A similar process 
is repeated for the remaining Cycles 3, 4, and 5. 

For each cycle, the y^(K^,X) and Y2^2'^^ distributions 
are combined using the relationship Y;l (K^/XJ/fy^ (K^ i X) + 

Y2( K 2'^^ to f orm five 5000 samples of size 5 for the Beta 

. . Ki Kt Ko 

distribution having mean = - — — — and variance » 

K l +K 2 (K 1 +K 2 ) 2 (K 1 +K 2 +1) 

The actual values of these calculations for each cycle are 

shown in Table 2. It should be apparent that each cycle 

generates 5000 samples of size 5 from a different Beta 

distribution. 
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The remaining Steps II, III, and IV of this simula- 
tion are identical with Steps II, III, and IV of Simulation 
1 under the Erlang distribution. Simulations 2, 3, and 4 for 
the Beta distribution use exactly the same techniques as 
outlined in Simulation 1 except that the sample sizes are 10, 

15, and 20 respectively. 

In addition, the effects of changes in the variance 
of the Beta distribution, when the mean is kept constant, are 
also investigated. This is accomplished by manipulating the 
values of the y^tK^,!) and y parameters that result in 

Beta distributions having the same mean but different variances. 
The sets of values of these parameters used during this simu- 
lation are furnished in Table 3 below. 



TABLE 3 



Cycle # 


Cycle 1 


Cycle 2 


Cycle 3 


Cycle 4 


Cycle 5 


Parameters of 


K| '2 

X =3 


K, -4 
X s 3 


K, =6 
X s 3 


K, = 8 
X * 3 


K, =10 
X = 3 


Parameters of 

Y,/Ki-,X) 


K 2 =2 
X *3 


K 2 = 4 
X * 3 


K 2 =6 
X * 3 


K 2 = 8 
X = 3 


k 2 = 10 

X = 3 


Mean (Beto) 


.5 


.5 


.5 


.5 


.5 


STD. DEV. (Beto) 


.036 


.022 


.0195 


.013 


.011 
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IV. RESULTS 



The results of the various simulations for the Erlang and 
Beta distributions are furnished in tabular form in appendices 
C, D, E, F, G, and H. Appendix C contains Tables 4, 5, 6, and 
7. These tables contain the frequency comparisons of the dis 
tributions obtained in these simulations and from Standard Nor- 
mal designated N(0,1), when samples of size, 5, 10, 15, and 20 
are used respectively. Tables 8, 9, 10, and 11 in Appendix D 
contain similar results obtained from the use of Beta distribu- 
tions. Appendix E contains Tables 12 and 13. These tables 
have the comparison of the frequency distributions from N(0,1) 
and the transformed Beta (0,1) when the mean of the distribution 
is kept constant, and its variance is manipulated. 

Appendices F, G, and H contain five tables each. These 
tables provide the necessary comparisons between the actual 
and observed significance levels for a fixed mean and 4, 9, 14, 
and 19 degrees of freedom. Tables 14, 15, 16, 17, and 18 in 
Appendix F pertain to Erlang distribution and contain the 
comparisons of the actual and the observed significance levels 
when samples of size 5, 10, 15, and 20 are used respectively. 
Tables 19, 20, 21, 22, and 23 in Appendix G contain similar 
results for the Beta distributions. Tables 24, 25, 26 27, 
and 28 in Appendix H have the comparisons of the significance 
levels when the variance of the Beta distribution is manipu- 
lated for a fixed mean. 

Since there are but two different table formats used to 
illustrate the results, it is considered appropriate to discuss 
the use of only one table of each type before proceeding with 
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the discussion of the results. Tables of the first type, 
which give frequency comparisons, are used in Appendices C, 

D, and E. The second type of table is used to provide the 
comparisons of sigificance levels, and these tables are 
contained in Appendices F, G, and H. 

As mentioned earlier. Table 4 pertains to the Erlang 
distribution. This table is divided into two parts. Part I 
furnishes the means and the standard deviations of the sampling 
distributions when the samples of size 5 are used. The means 
and standard deviations for each distribution are identified 
by means of index numbers. For example. Index (1) pertains 
to the Erlang distribution with mean 1.33 and standard devia- 
tion .6669. Part II of this table furnishes the frequency 
distributions of each Erlang given in Part I and the frequency 
distributions of N(0,1). In order to relate the mean and 
variance with the corresponding frequency distributions, the 
appropriate index should be consulted. For example, if one 
wishes to compare the frequency distribution of an Erlang with 
mean 2.0 and standard deviation .82, when using samples of 
size 5; one should consult Appendix C, Table 4. Corresponding 
to the above values, one would select Index 3 from Part I of 
Table 4 and obtain the frequency distribution from the row (3) 
in Part II of this table. In order to compare this frequency 
distribution with N(0,1), one would compare the corresponding 
values of row 3 with the row marked N(o,l). The asterisks on 
the distribution frequencies indicate that the distribution 
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was found to be significantly different than N(0,1) by chi- 
square goodness-of-f it test at 95% confidence level. 

The second type of table is found in Appendices F, G, and 
H. Table 15 in Appendix F pertains to the Erlang distribution 
with a fixed mean of 1.667. This table provides the comparison 
of the actual significance levels and the observed significance 
levels when the sampling population is Erlang with mean 1.667. 
This table furnishes these comparisons for four values of de- 
grees of freedom. These values are 4, 9, 14, and 19; they 
are found in the first column of the table. The format of this 
table is very similar to the standard t-table found in most 
text books. In order to make use of this table, one is re- 
quired to know the sample size and the population mean. For 
example, if one is interested in determining the comparison of 
significance levels; when sampling from Erlang distribution 
with mean 2.0 using sample size 15, one would consult Appendix 
F, Table 16 and obtain the required information in the, row 
marked 14. The asterisks on the observed significance values 
indicate that the results were found to be significantly 
different at 95% confidence level by the use of chi-square 
goodness-of-f it test. 

RESULTS OF ERLANG 

The results shown in Tables 4, 5, 6, and 7 indicate that 
the Erlang frequency distributions are significantly different 
than the N(0,1) at 5% level. A comparison of the observed 
significance levels with the actual significance levels in 
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Table 14, Appendix F reveals that when an Erlang distribution 
with mean 1.333 is used the observed alpha values are signifi- 
cantly different from the actual alpha values. For example, 
for Sample Size 5, corresponding to the actual values of a = 

.5, .2, and .05; the observed values are a = .034, .004, and 
.0004 respectively. It is further noted that as the sample 
size is increased to 10, 15, and 20, the corresponding observed 
values are zero in each case. This means that all the absolute 
values of t statistic are smaller than the critical values. 

From the results in Table 15, one observes that for 
sample size 5 at a = .5, .2, .05, the observed values are a = 
.0292, .002, and .0002 respectively. Each of these values is 
smaller than the corresponding values in Table 14. The results 
for sample size 10, 15, and 20 are the same as found in Table 
14. The results in Tables 16 and 17 indicate that for sample 
size 5, the corresponding observed values in each case are 
a = .0292, .0034, .0002 and .0268, .0028, .0002 respectively. 

The alpha values for sample sizes 10, 15, and 20 remain zero. 

In examining the graphs of these distributions, one ob- 
serves that as the mean of the distribution is increased; the 
distribution, though flatter than the normal distribution, 
becomes more nearly symmetric around the mean. However as the 

mean of the distribution is increased, the variance ( ) is 

X2 

also increased. Consequently, when sampling from this distri- 
bution, the absolute values of the t statistic, X-)J , are 

S//R 
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smaller than the absolute values of this statistic when sampl- 
ing from the normal population. Furthermore, these values 
decrease as the sample size is increased. 

From the foregoing results, it is concluded that "Stu- 
dent's" t-test is sensitive to the Erlang distribution. It is 
further observed that when sampling from an Erlang distribu- 
tion, the resulting t statistic has significantly shorter tails 
than the "Student's" t distribution. It would seem, therefore, 
that the resulting t statistic is a form of leptokurtic curve. 
RESULTS OF BETA DISTRIBUTION 

Results in Tables 8, 9, 10, and 11 reveal that except 
when the sample size is 20; and the distribution is symmetric 
around the mean (.5), the Beta distribution frequencies are 
significantly different from the N(0,1). A comparison of the 
results in Table 19, Appendix G reveals that when sampling 
from the Beta distribution with mean .5, the observed alpha 
values in most cases are not significantly different from the 
actual alpha values. For example, for sample size 5, the 
observed alpha values are only significant at a = .8 and .5. 
However, when the sample size is increased to 10, 15, and 20; 
the results are only significant at a = .8. This suggests 
that when sampling from a symmetric Beta distribution, using 
samples of size 5 cr greater, the "Student's" t-test yields 
results that are comparable with results obtained from normal 
sampling distributions. 

The results in Table 20 reveal that when sampling from 
the Beta distribution with mean .44, the observed alpha values 
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for sample size 10 remain unchanged. The observe alpha values 
using sample size 5 are significantly different at a = .8, 

.5/ and .2. The observed alpha values for sample size 20 are 
significant at a = .8, .5, .1 and .05. Comparing similar 
results in Tables 21 and 22, one observes that as the mean of 
the sampling distribution is varied to the left of the mean, 
the use of larger sample sizes (10, 15) yield observed alpha 
values that are comparable to the actual alpha values. In 
Table 23, one observes that when the sampling distribution has 
mean .33; i.e., it is significantly skewed to the right, the 
observed alpha values for sample size 5 are significantly 
different than the actual alpha values. The results for 
sample size 10 are reasonably comparable with the actual alpha 
values. 

From the foregoing, it is concluded that the "Student's" 
t-test is sensitive to changes in the shape resulting from 
changes in the mean of the distribution. When the Beta distri- 
bution is symmetric around the mean, the use of "Student's" 
t-test yields results that are comparable with those obtained 
when sampling from the normal population. It is further noted 
that most of the observed alpha values are smaller than the 
actual alpha values. This suggests that the values of the t 
statistic are smaller than the corresponding values of 
"Student's" t statistic. It would seem that the tails of the 
new t statistic are shorter than the tails of "Student's" t. 

The results of the Beta distributions with fixed mean 
and different variances are contained in Tables 24, 25, 26, 
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27, and 28. A similar investigation of these results indicates 
that for small sample size (5) the observed alpha values are 
significantly different than the true values. However, as 
the sample size is increased to 10, the difference between the 
observed and actual alpha values is significantly reduced. 

This suggests that "Student's" t-test is sensitive to changes 
in the variance when small samples are used. 
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V. CONCLUSIONS AND RECOMMENDATIONS 



The simulation described on the foregoing pages which 
investigated the robustness of "Student's" t-test has proven 
to be extremely realistic. The results of violating the 
assumption of normality by the use of the Beta and the Gamma 
distributions indicate that when the sampling distribution is 
symmetric around the mean, the use of "Student's" t-test gives 
results that are closely comparable with the values obtained 
by the use of normal distributions. This implies that 
"Student's" t-test is reasonably robust when it is used with 
symmetric populations. . The results also indicate that in the 
case of skewed distributions, the observed alpha values are 
significantly different from the actual significance levels. 

It is further noted that in the case of symmetric distributions, 
the values of observed significane levels approach the true 
values as the sample size is increased. 

In the case of symmetrical distributions with a fixed 
mean and different variances, it is observed that the "Stu- 
dent's" t-test is sensitive to the change in the variance. The 
observed significance levels are considerably different from 
the true values as the variance of the sampling population is 
increased. However, as the sample size is increased, the 
effects of change in the variance are considerably reduced. 
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GRAPHS OF ERLANG DISTRIBUTIONS FOR 



foot cn cd ^ <x> to o 



APPENDIX B 




GRAPH OF BETA DISTRIBUTIONS Q = K, 
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GRAPH OF BETA DISTRIBUTIONS WITH MEAN = 0.5, 
DIFFERENT VARIANCE. 
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Table 4 

POPULATION Erlang 
SAMPLE SIZE 5 



Part I Population Parameters 



Mean 


1.33 


1.667 


o 


2.333 


2.667 


STD. DEV. 


.6669 


.74 


. 82 


.86 


.93 


Index 


(1) . 


(2) 


(3) 


(4) 


(5) 



Part II Comparisons of Distribution Frequencies 



. Inter- 
vals 


< 

y-3o 


y-o 


y-o 


y-o 


y+a 


y+o 


y+a 


> 

y+3a 


N(0,1) 
Y (0,1) 


74 


524 


3399 


8499 


8499 


3399 


524 


74 


Index 

(D* 


0 


0 


3581 


10575 


6991 


2793 


813 


247 


(2)* 


0 


10 


3653 


10250 


7244 


2820 


990 


233 


(3)* 


0 


26 


3699 


10161 


7262 


2841 


774 


237 


(4)* 


0 


62 


3797 


9743 


7469 


2948 


802 


179 


(5)* 


0 


75 


3824 


9762 


7390 


2490 


782 


177 
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Table 5 

POPULATION: Erlang 

SAMPLE SIZE: 10 



Part I Population Parameters 



Mean 


1.333 


1.667 


2.0 


2.33 


2.667 


STD. DEV. 


.6604 


.749 


.83 


.905 


.9755 


Index 


(1) . 


(2) 


(3) 


(4) 


(5) 



Part II Comparison of Distribution Frequencies 



Inter- 

vals 


< 

y-3a 


y-3a 


y-2a 


y-a 


y+a 


y+2a 


y+3a 


> 

y+3a 


N (0 , 1) 

y (0,1) 


149 


1049 


6799 


16999 


16999 


6799 


1049 


149 


Index 

(D* 


0 


1 


7294 


20977 


14043 


5495 


1670 


520 


(2)* 


0 


11 


7029 


20628 


14691 


5610 


1572 


459 


(3)* 


0 


39 


6854 

« 


20464 


15090 


5636 


1500 


417 


(4)* 


0 


46 


6702 


20443 


15197 


5819 


1447 


346 


(5)* 


0 


78 


6670 


20394 


15226 


5822 


1490 


320 
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Table 6 

POPULATION: Erlang 

SAMPLE SIZE: 15 



Population Parameters 



Mean 


1.33 


1.667 


2.0 


2.333 


2.667 


STD. DEV. 


.6725 


.7677 


.8466 


.9173 


.9983 


Index 


(1) . 


(2) 


(3) 


(4) 


(5) 



Part II Comparisons of Distribution Frequencies 



Inter- 

vals 


< 

y-3a 


y-3cr 


y-2a 


y-a 


y+o 


y+2cr 


y+3o 


> 

y+3a 


N(0.1) 
Y (0,1) 


224 


1574 


10199 


25499 


25499 


10199 


1574 


224 


Index 

(D* 


0 


0 


10018 


31845 


21764 


8223 


2423 


724 


(2)* 


0 


4 


9739 


31562 


22509 


8283 


2216 


687 


(3)* 


0 


25 


9802 


31101 


22752 


8495 


2272 


553 


(4)* 


0 


54 


9750 


30661 


23129 


8783 


2139 


484 


(5)* 


0 


75 


9763 


30455 


23504 


8666 


2082 


455 
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Table 7 

POPULATION: Erlang 

SAMPLE SIZE: 20 



Part I Population Parameters 



Mean 


1.33 


1.667 


2.0 


2.333 


2.667 


STD. DEV. 


.6825 


.7710 


.8536 


.9232 


. 9957 


Index 


(1) 


(2) 


(3) 


(4) 


(5) 



Part II Comparisons of Distribution Frequencies 



Inter- 

vals 


1 A 

U> 

Q 


y-3a 


y-2a 


y-a 


y+a 


y+2a 


y+3a 


> 

y+3a 


N(0,1) 

Y(0,1) 


299 


2099 


13599 


33999 


33999 


13599 


2099 


299 


Index 

(D* 


0 


0 


12775 


43055 


29172 


11113 


3142 


943 


(2) 


0 


5 


12569 


42147 


30198 


11222 


3055 


809 


(3)* 


0 


30 


12625 


41768 


30649 


11277 


2897 


754 


(4)* 


0 


69 


12582 


41276 


31091 


11553 


2798 


631 


(5)* 


0 


109 


12587 


40938 


31459 


11636 


2705 


566 
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Table 8 

POPULATION: Beta 

SAMPLE SIZE: 5 



Part I Population Parameters 



Mean 


.5 


.44 


.40 


.36 


.33 


STD. DEV. 


.167 


.1589 


.50 


.140 


.1329 


Index 


(1). 


(2) 


(3) 


(4) 


(5) 



Part II Comparison of Distribution Frequencies 



Inter- 

val 


< 

y-3a 


y-3a 


y-2a 


y-o 


y+a 


y+2a 


y+3a 


> 

y+3a 


N (0 ,1) 
Beta (0 ,-n 


74 


524 


3399 


8499 


8499 


3399 


524 


74 


(D* 


0 


411 


3879 


8062 


8333 


3896 


419 


0 


(2)* 


0 


298 


3894 


8414 


8146 


3670 


575 


3 


(3)* 

1 


0 


194 


3890 


8739 


7987 


3515 


660 


15 


(4)* 


0 


151 


3942 


8919 


7849 


3445 


656 


38 


(5)* 


0 


100 


3919 


9111 


7736 


3382 


696 


56 
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Table 9 



POPULATION: Beta 

SAMPLE SIZE: 10 



Part I Population Parameters 



Mean 


.5 


.44 


.40 


.36 


.33 


STD. DEV. 


.167 


.157 


.148 


.1397 


.132 


Index 


(1) 


(2) 


(3) 


(4) 


(5) 



Part II Comparison of Distribution Frequencies 



Inter- 

val 


< 

y-3a 


y-3a 


y-2cr 


y-o 


y+a 


y+2a 


y+3o 


> 

y+3a 


. N(o,l) 
8eta (0,1) 


149 


1049 


6799 


16999 


16999 


6799 


1049 


% 

149 


Index 

(D* 


0 


802 


7637 


16467 


16467 


7726 


935 


0 


(2)* 


0 


629 


7841 


16813 


16123 


7418 


1164 


10 


(3)* 


0 


481 


7796 


14386 


15766 


7139 


1387 


45 


(4)* 


0 


325 


7879 


17801 


15558 


6916 


1457 


64 


(5)* 


0 


261 


7789 


18124 


15378 


6931 


1396 


121 
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Table 10 

POPULATION: Beta 

SAMPLE SIZE: 15 



Part I Population Parameters 



Mean 


.5 


.44 


.4 


.36 


.33 


STD . DEV . 


.166 


.1577 


.1477 


.1385 


.1308 


Index 


(1) . 


(2) 


(3) 


(4) 


(5) 



Part II Comparisons of Distribution Frequencies 



Inter- 

val 


< 

y-3a , 


y-3a 


y-2a 


y-a 


u+a 


u+2a 


y+3a 


> 

y+3a 


; N(0,1) 
Beta (0 ,1) 


224 


1574 


10199 


25499 


25499 


10199 


1574 


224 


Index 

(D* 


0 


1225 


11574 


24441 


24621 


11812 


1327 


0 


(2)* 


0 


931 


11741' 

i 


25404 


23774 


11407 


1727 


16 


(3)* 


0 


683 


11820 


26111 


23470 


10762 


2105 


49 


(4)* 


0 


530 


11927 


26398 


23346 


10494 


2189 


116 


(5)* 


0 


429 


11841 


26916 


23037 


10283 


2275 


176 
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Table 11 

POPULATION: Beta 

SAMPLE SIZE: 20 



Part I Population Parameters 



Mean 


.5 


.44 


.40 


.36 


.33 


STD. DEV. 


.167 


.156 


.1469 


.138 


.1297 


Index 


(1) 


(2) 


(3) 


(4) 


(5) 



Part II Comparison of Distribution Frequencies 



Inter- 

vals 


< 

y-3cr 


y-3a 


CM . 
1 


y-a 


y+a 


y+2a 


y+3a 


> 

y+3a 


N (0 ,1) 

r - 

Beta (0 .11 


299 


2099 


13599 


33999 


33999 


13599 


2099 


299 


Index 

(D* 


0 


1641 


15458 


32592 


3271 


15798 


1780 


0 


(2)* 


0 


1252 


15714 


33539 


32161 


14858 


2447 


29 


(3)* 


0 


970 


15740 


34596 


31365 


14505 


2721 


103 


(4)* 


0 


766 


15774 


35284 


31087 


13905 


3005 


179 


(5)* 


0 


586 


15896 


35825 


30499 


13853 


3094 


247 
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Table 12 

POPULATION : Beta 

SAMPLE SIZE: 5 (Fixed Mean, Difference Variance) 



Part I Population Parameters 



Mean 


.5 


.5 


.5 


.5 


.5 


Variance 


.1036 


.022 


.0195 


.013 


.0108 


Index | 


| 


(2) 


(3) 


(4) 


(5) 



Part II Comparison of Distribution Frequencies 



Inter- 

val 


< 

y-3o 


y-3cr 


y-2a 


y-a 


y+a 


y+2a 


y+3a 


> 

y+3a 


N(0,1) 
Beta (0,1) 


74 


524 


3399 


8499 


8499 


3399 


524 


74 


Index 

(D* 


0 


166 


4394 


7941 


7764 


4572 


165 


0 


(2)* 


0 


397 


3908 


8075 


8336 


3846 


436 


0 


(3)* 


3 


430 


3584 


8573 


8456 


3659 


493 


2 


(4)* 


3 


439 


3573 


8399 


8492 


3597 


486 


11 


(5)* 


9 


431 


3456 


8496 


8575 


3581 


439 


11 
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Table 13 

POPULATION: Beta 

SAMPLE SIZE: 10 (Fixed Mean, Difference Variance) 



Part I Population Parameters 



Mean 


.5 


.5 


.5 


.5 


.5 


Variance 


.036 


.022 


.0195 


.013 


.0108 


Index 


(1). 


(2) 


(3) 


(4) 


(5) 



Part II Comparison of Distribution Frequencies 



Inter- 

vals 


< 

y-3a 


y-3a 


y-2o 


y-a 


y+a 


y+2a 


y+3a 


> 

y+3a 


N ( 0 , 1 ) 
Beta (0.1) 


149 


1049 


6799 


16999 


16999 


6799 


1049 


14 9 


Index 

(D* 


0 


399 


8788 


15650 


15758 


8963 


442 


0 


(2)* 


0 


838 


7653 


16251 


16653 


7811 


894 


0 


(3)* 


8 


861 


7361 


L6598 


16669 


7534 


959 


11 


(4)* 


15 


946 


7026 


L6749 


16940 


7318 


985 


21 


(5) 


12 


919 


6985 


L6834 


16964 


7226 


1033 


27 
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Table 14 

POPULATION: Erlang 

POPULATION MEAN: 1.33 



Degrees 

of 








VALUES 


OF i-a 






Freedom 


.2 


.5 


.8 


.9 


.95 


.98 


.99 


.999 


4 


.7416* 


.9660* 


.9960* 


.9982* 


.9996* 


1.0* 


1.0* 


1.0 


9 


.9586* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0 


14 


.9946* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0 


19 


.9996* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0 
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Table 15 

POPULATION: Erlang 

POPULATION MEAN: 1.667 



Degrees 

of 

Freedom 


VALUES OF 1-a 


.2 


.5 


.8 


.9 


.95 


.98 


.99 


.999 


4 


.7480* 


.9708* 


.9980* 


.9998’ 


: . 9998 * 


1.0* 


1.0* 


1.0 


■9 


.9650* 


.998* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0 


14 


.9952* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0 


19 


.9998* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0 
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Table 16 



POPULATION: Erlang 

POPULATION MEAN : 2.0 



Degrees 

of 

Freedom 


VALUES OF 1-a 


.2 


.5 


.8 


.9 


.95 


.98 


.99 


.999 


4 


.7466* 


.9708* 


.9966* 


.9994* 


.9998* 


1.0* 


1.0* 


1.0 


9 


.9608 


.999* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0 


.14 


.9958* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


19 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0 
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Table 17 

POPULATION : Erlang 

POPULATION MEAN: 2.33 



'' Degrees 
of 








VALUES 


OF 1-c 


l 






, Freedom 


.2 


.5 


.8 


.9 


.95 


.98 


.99 


.999 


4 


.7394* 


.9732* 


.9972* 


.9994* 


.9998* 


1.0* 


1.0* 


1.0 


9 


.9640* 


.999* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0 


14 


.9970* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0 


19 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0 
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Table 18 

POPULATION: Erlang 

POPULATION MEAN: 2.667 



Degrees 

of 

Freedom 


VALUES OF 1-a 


.2 


.5 


.8 


.9 


.95 


.98 


.99 


.999 


4 


.754* 


.9750* 


.9982* 


.998* 


1.0* 


1.0* 


1.0* 


1.0 


9 


.9656* 


.999* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0 


14 


.9962* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0 


19 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0* 


1.0 
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APPENDIX G 



Table 19 

POPULATION: BETA 

POPULATION MEAN : . 5 



Degrees 

of 

Freedom 


ACTUAL SIGNIFICANCE LEVELS 


. 8 


.5 


.2 


.1 


LO 

O 

• 


.02 


.01 


.001 


4 


.793* 


.485* 


.196 


.099 


.051 


.020 


.011 


.002 


9 


.795* 


.497 


.199 


.099 


.051 


.023 


.014 


.002 


14 


.807* 


.498 


.196 


.095 


.050 


.040 


.010 


.001 


19 


.809* 


.501 


.199 


.102 


.051 


.021 


.01 


.001 



49 



APPENDIX G 



Table 20 



POPULATION: Beta 

POPULATION MEAN: .44 



Degrees 

of 

Freedom 


ACTUAL SIGNIFICANCE LEVELS 


.8 


.5 


.2 


.1 


.05 


.02 


•— 1 
o 

• 


.001 


4 


.793* 


.480* 


.190* 


.099 


.050 


.021 


.Oil 


.001 


9 


.805* 


.053 


.200 


.102 


.050 


.021 


.011 


.001 


14 


.799 


.489* 


.193* 


.098 


.050 


.041 


.012 


.001 


19 


.805* 


.509* 


.200 


.109* 


.058* 


.023 


.02 


.001 
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Table 21 



POPULATION: Beta 

POPULATION MEAN: .40 



Degrees 

of 

Freedom 


ACTUAL SIGNIFICANCE LEVELS 


.8 


.5 


.2 


.1 


.05 


.02 


.01 


.001 


4 


.793* 


.488* 


.192* 


.096 


.050 


.020 


.010 


.001 


9 


.802 


.491* 


.192* 


.096 


.048 


.021 


.010 


.001 


14 


.803* 


.503 


.196 


.101 


.053 


.043 


.011 


.002 


19 


.791* 


.494* 


.189 


.094 


.046 


.018 


.01 


.002 
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Table 22 



POPULATION: Beta 

POPULATION MEAN: .36 



Degrees 

of 

Freedom 


ACTUAL SIGNIFICANCE LEVELS 


.8 


.5 


.2 


.1 


.05 


.02 


.01 


.001 


4 


.798 


.492* 


.204 


.098 


.053 


.020 


.010 


.001 


9 


.793* 


.494* 


.194 


.102 


.054 


.021 


.012 


.001 


14 


.795* 


.507* 


.204 


.100 


.051 


.043 


.011 


.002 


19 


.803* 


.501 


.196 


.094 


.050 


.022 


.01 


.001 
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Table 23 



POPULATION: Beta 

POPULATION MEAN: .33 



Degrees 

of 

Freedom 


ACTUAL SIGNIFICANCE LEVELS 


.8 


.5 


.2 


.1 


.05 


.02 


. 01 


.001 


4 


.801 


.504 


.205 


.107* 


.061* 


.023* 


.012* 


.003* 


9 


.793* 


.504 


.200 


.101 


.052 


.023 


.011 


.001 


14 


.796* 


.507* 


.209* 


.106 


.052 


.046 


.013 


.002 


19 


.794* 


.493* 


.193* 


.103 


. 0."4 


.023 


.02 


.001 
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Table 24 

Beta (Fixed Mean- Different Variance) 

.5 

.036 



| Degrees 
of 

Freedom 


ACTUAL SIGNIFICANCE LEVELS 


.8 


.5 


.2 


.1 


.05 


.02 


.01 


.001 


4 


.7860* 


.4796* 


.2084* 


.1160* 


.0682* 


.0314* 


.0176* 


.0012 


9 


.7952* 


.4870* 


.1948 


.1054 


.0562* 


.0244 


.0132 


.0018 



POPULATION: 
POPULATION MEAN: 
POPULATION VARIANCE: 
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Table 25 

POPULATION: Beta (Fixed Mean-Different Variance) 

POPULATION MEAN : . 5 

POPULATION VARIANCE: .022 



Degrees 

of 

Freedom 


ACTUAL SIGNIFICANCE LEVELS 


.8 


.5 


.2 


.1 


.05 


.02 


.01 


.001 


4 


.7874* 


.4842* 


.2002 


.1052 


.0538 


.0238 


.0118 


.0012 


9 


.8008 


.4934 


.1992 


.0998 


.0538 


.0220 


.0130 


. 0012 
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Table 26 

Beta (Fixed Mean-Different Variance) 
.5 

.0195 



Degrees 

of 

Freedom 


ACTUAL SIGNIFICANCE LEVELS 


.8 


.5 


.2 


.1 


.05 


• 

o 

to 


.01 


.001 


4 


.7908* 


.4914* 


.1948 


.1068 


.0556 


.0224 


.0102 


.0008 


9 


. 8012 


.5042 


.1956 


.0976 


. 0486 


.0220 


.0094 


.0008 



POPULATION: 
POPULATION MEAN: 
POPULATION VARIANCE: 
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Table 27 



POPULATION: Beta 

POPULATION MEAN : . 5 

POPULATION VARIANCE: .013 



Degrees 

of 

Freedom 


ACTUAL SIGNIFICANCE LEVELS 


.8 


.5 


.2 


.1 


.05 


.02 


1 

t — 1 

o 

• 


.001 


4 


. 7864* 


.4824* 


.1868* 


.0924* 


.0484 


.0188 


.0084 


.0002 


9 


. 7910* 


.4908* 


.1940 


.0970 


.0492 


.0196 


.0112 


.0018 
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Table 28 



POPULATION : Beta 

POPULATION MEAN: .5 

POPULATION VARIANCE: .0108 



Degrees 

of 

Freedom 


ACTUAL SIGNIFICANCE LEVELS 


.8 


.5 


.2 


.1 


.05 


.02 


.01 


.001 


4 


.8026* 


.4906* 


.2004 


.1 


.0498 


.0204 


.0104 


.0014 


9 


.7910 


.5030 


.2010 


.0974 


.0464 


.0172 


.0090 


.0008 
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Instructions for the Use of Computer Program 

The attached computer program was used to obtain the 
simulation results for the Beta distribution. The program, 
written in Fortran language, makes use of an IBM 360 computer. 
This program consists of four simulations, requires 47 minutes 
of execution time, and makes use of 505 K storage capacity. 

The program as presented in Appendix J, is self sufficient in 
that it does not require any external sub-routines or programs 
for its execution. The program contains its own random number 
generator with the variant seed characteristics. It enables 
the user to manipulate the various parameters without any 
reorganization of the program. 

To assist users in understanding and utilizing various 
facets of the program, a number of comment cards are used 
through-out the program. In order to illustrate the techniques 
of varying the parameters of the Erlang distribution or the 
number of cycles in the simulation, or the number of samples 
for the distributions, the following instructions could be 
useful. If one consults Appendix J, page 62, he should find 
that parameters B^ = 3 corresponds to X = 3, and the values of 
Kj_ and K 2 represent the values of parameter K in the Erlang 
distribution. In order to obtain five different Erlang distri- 
butions in each cycle of each simulation (as illustrated in 
Table 2) , one would set = 4 , = 3 and make use of the Do 

loop 777. Thus in order to change the number of cycles to 3, 
one would simply replace Do 777 IB=1,5 with Do 777 IB=1,3. 
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This program is designed to furnish 4 different sample 
sizes. This is accomplished by the first Do loop on page 62 ; 
i.e. , Do 778 IN=1,4. The sample size is determined by the 
value of the increment in statement, MR = MR + 5. If four 
samples of sample size 2, 4, 6, and 8 are required, one would 
simply change the statement MR = MR +2. If only two samples 
of 5000 each are desired, then one would replace Do 778 IN = 

1,4 by Do 778 IN = 1,2. The number of samples in each simula- 
tion are determined by the Do loop "Do 1000 II = 1,50. As is 
apparent from the dimensions of GAMMAl and GAMMA2 , the program 
for given values of and K 2 generates 100 samples of size 
MR which are Beta distributed. It calculates the new t 
statistic, compares the absolute values of this statistic 
with the critical values, and stores the beta variate in 
location called "STOR" . Utilizing a different seed, the entire 
process is repeated 50 times, thus yielding a total of 5000 
samples of size MR. After the generation of 5000 samples which 
are located in "STOR", the distribution frequency comparisons 
are performed; and the desired results are provided as indi- 
cated in Appendix J. The values of parameters are then 
incremented by the Do loop 778, and the entire process is 
repeated until 5 cycles are completed for the value of sample 
size. At the end of Simulation I, the sample size is incre- 
mented, and the entire process is repeated. 

A sample of the results of one simulation are furnished 
in Appendix J. It may be recalled that each simulation 
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consists of five cycles; for each cycle, the program furnishes 
the following information as shown in Appendix J: 

1. Transformed Beta frequency distribution denoted by (T) . 
N(0,1) frequency distribution denoted by (?) . 

Sample size denoted by (?) . 

Distribution mean denoted by (?) . 

The actual levels of significance denoted by ( 5 ) . 

The observed levels of significance denoted by (?) . 



2 . 

3. 

4. 

5. 

6 . 
7. 



The X 2 (1) statistic for each level of significance 
.95 



denoted by (?) . 

At the end of the simulation, a summary of the results is 

furnished which contains the actual alpha values, the observed 

2 

alpha values for each of the five cycles, X 95 values for one 

2 

degree of freedom, and the observed X (1) statistics for each 
significance level. The computer program for the Gamma 
distribution also utilizes the same techniques. This program 
is furnished in Appendix J. 
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COMPUTER PROGRAM FOR ERLANG DISTRIBUTION 
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The Beta frequency intervals are as follows 
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Consolidate Results of Simulation 
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